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to! . Abstract 

We revisit and prove some convexity inequalities for trace functions conjectured in the earlier 
part I. The main functional considered is 

m 

for m positive definite operators Aj . In part I we only considered the case q = 1 and proved the 
concavity of $p.i for < p < 1 and the convexity for p = 2. We conjectured the convexity of $p^i 
for 1 < p < 2. Here we not only settle the unresolved case of joint convexity for 1 < p < 2, we 
' are also able to include the parameter q > 1 and still retain the convexity. Among other things 

. this leads to a definition of an L^(L^') norm for operators when 1 < p < 2 and a Minkowski 

inequality for operators on a tensor product of three Hilbert spaces - which leads to another 
proof of strong subadditivity of entropy. We also prove convexity/concavity properties of some 
other, related functionals. 
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1 Introduction 

Let M„ denote the set of n x n matrices with complex entries, and let A* denote the Hermitian 
conjugate of A £ M„. For < p < oo, and A E M„, define 



\A\\q = {Tr[{A* Ay/^]) 



!/<? 
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For q > 1, this defines a norm on M„, but not for q < 1. Nonetheless, it will be convenient here to 
use this notation for all q > 0. Finally, define ||^||oo to be the operator norm of A. 

Let H„ denote the set of n x n Hermitian matrices, and let H+ denote the set of n x n positive 
semidefinite matrices. For any two positive integers m and n, we may identify operators on C^^C" 
with mn x mn matrices, and then we have the two partial traces Tri and Tr2: For any mn x mn 
matrix A, T12A is the m x m matrix such that for all m x m matrices B, 

Tr[(Tr2^)B] = Tr[A(S ® /„xn)] , 

and Tri, which "traces out" the first factor, is defined analogously. Later, this notation will be 
extended to tensor products of more factors in the obvious way, without further discussion. 
This paper concerns properties of the following trace functionals: 

(1) For any numbers p,q > 0, and any positive integer m, define on the m-fold Cartesian 
product of with itself by 



%,,{A„...,A^) = \\ (E7=i^P'/^ 



:i.ii 



(2) For any numbers p,q > 0, and any positive integers m and n, define ^'p^g on by 



%,,{A) = II (Tr2^n 



i/p 



(1.2) 



(3) For any fixed n x n matrix B and any numbers p,q > 0, define Tp^g on H+ by 



Tp^g{A) = Tr {B*APBy/P 



[1.3) 



We shall determine conditions on p and q under which these functionals are convex or concave 
(jointly, in the case of ^p,g)- 

1.1 THEOREM. For all 1 < p <2, and for all q >1, '^p,q and ^p,q are convex on H,^ and 
respectively, while is jointly convex on (11+)™-. 

For < p < q < 1, Tp^g and ^p^q are concave on and H+„ respectively, while ^p^q is jointly 
concave on (H^)™". 

For p > 2, none of these functions are convex or concave for any values of q ^ p. 

1.2 Remark. Note that Tp^g is homogeneous of degree q > 1. By a general argument (Tp^^)^^"^ is 
also convex/concave. (The general argument is this: A function / that is homogeneous of degree one 
is convex (concave) if and only if the level set {x : f{x) < 1} ( {x : f{x) > 1}) is convex. Hence, 
if g{x) is homogeneous of degree g, and convex (concave), so that {x : g{x) < 1} {{x : f{x) > 1}) 
is convex, g^^'^ is convex (concave).) 

Apart from the elementary case p = 2,q = 1 which is discussed and proved in [3], the parts of 
this theorem that refer to convexity are new. The parts of this theorem that refer to concavity are 
already known in the case q = 1, but are new for other values of q: For < p < 1, the concavity of 
Tp^i is a theorem of Epstein [6], while the concavity of ^p,i and joint concavity of <I>p_i was proved 
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in [3]. Also in [3], the convexity of ^p^i and ^p,i for 1 < p < 2 was conjectured. In Theorem ll.lt 
this conjecture, and more, is verified. 

We also note that Bekjan [3] has proved the concavity of i for —1 < p < 0. While our interest 
here is in positive values of p, we shall return to Bekjan's work towards the end of the paper when 
we summarize the relations among these various convexity and concavity results. 

We have stated these inequalities for matrices, but as none of them refers to the dimension, it 
is easy to extend them to operators on a separable Hilbert space 7i, and we take this for granted 
in the next paragraphs. There are other, much more interesting extensions to be considered: For 
example, in [S], among other things Hiai has extended the concavity results from [5] to a von 
Neuman algebra setting in which the trace is replaced by a more general state. In this regard see 
also the paper pTj by Kosaki. 



1.1 Application to non-commutative Minkowski inequalities 

As an application of Theorem ll.il we deduce a tracial analog of Minkowski's inequality for multiple 
integrals: Let {X, d/i) and (Y, di/) be sigma-finite measure spaces, and then for any non-negative 
measurable function f on X x Y, and any 1 < q < p, 



Y 



X 



i/p 



< 



X 



l/p 

fP{x,y)du] d/z 



Y 



1/9 



For < p < g < 1, the inequality reverses. (When p = 1, one has the "standard" Minkowski 
inequality. This form involving p and q is obtained by applying the standard inequality for the L^/ 
norm to F{x,y) = f''{x,y).) 

Introducing a third measure space (Z, dA), we have the pointwise inequality 



j^y^nx,y,z)df,Y' du 



i/p 



< 



X 



i/p 

fP{x,y,z)du] dfi 



1/9 



(1.4) 



for any non-negative measurable function f on X x Y x Z, and any 1 < q < p- Now, raising both 
sides to the power q and then integrating over Z, one obtains 

q/p 



Y 



X 



\ P/9 

f'^{x,y,z)dfj,j du 



dX < 




z JX 



q/p 

fP{x,y,z)du\ dfidX 



Y 



:i.5) 



To formulate the corresponding tracial inequalities, let 7ij, j = 1, 2, 3 be three separable Hilbert 
spaces. Let Ti denote Tii Ti.2 Ti^, and let Tr^ be the partial trace that "traces out" the jth 
Hilbert space. 

Then the tracial analog of (jl.4p would be 



Tro 



p/g 



i/p 



< Tr 



q/p 



1/9 



(1.6) 



as an operator inequality on Hj,. When the dimension of equals 1, there are really only two 
spaces present (1 and 2), and then (jl.6p is the two space operator inequality proved in [3]. When 
dim('H3) > 1, (|1.6p cannot hold as an operator inequality, for any p > 1, even for g = 1, as explained 
in [3]. However, for 1 < < 2, the analog of (|1.5p is true no matter what dim('H3) may be: 
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1.3 THEOREM. Forl<q<p<2, and all positive operators A on ?^ = ?^l (g) ?^2 O ^3, 

Trs (Xrs [(Tri^'?)^ ) < Trg (xn [(TVs^^^/P] ) (1.7) 
For ^ <p <1, and any q > p, this inequality reverses. 

For < p < 1, g = 1, and for p = 2, g = 1, this was proved in [3], while for 1 < p < 2 and q = 1 
it was conjectured in [4]. 

Note that for q = p = 1, we have an identity, and so we may obtain a new inequality by 
differentiating in p and q at p = q = 1. As shown in [4j, one obtains the strong subadditivity of 
the quantum entropy in this way. To state this result, first proved in [T3], we recall the definition 
of the quantum entropy: A density matrix p on a separable Hilbert space W is a positive operator 
on Ti. such that Tr{p) = 1. The entropy of p, S{p), is defined by 

S(p)=-Tr(plnp) . 

Since (|1.7p is an equality at p = g = 1, we can obtain inequalities by differentiating in p at 
p = q = 1. Since 

^T:(/)|^^^=Tr(plnp) , 

the resulting inequality will involve the entropies of various partial traces of p. We shall use the 
following notation for these: 

P123 = P P23 = Trip , Pa = TriTr2p 

and so forth. (That is, the subscripts indicate the spaces "remaining" after the traces.) 

Then differentiating in p at p = q = 1, we obtain the strong subadditivity of the quantum 
entropy [T3]. [12]: 

S{pi3) + S{p23) > S{pu3) + S{ps) , 

just as in [4j, except that there, only a left derivative was taken since the Minkowski inequality was 
then known only for for p < 1 and q = 1- 

1.4 Remark. Since we did not provide the details of the differentiation argument in our previous 
paper, we take the opportunity to do so here. The basic fact is that for a positive operator A, and 
e close to zero, 

A^+' = A + eAlnA + 0{e'^) . 

At least in finite dimensions, one can take a partial trace of both sides, and the resulting identity 
still holds. 

Applying this with A = p in finite dimensions, we compute 

TV2 (p^+^) = P13 + eTi2iplnp) + 0{e^) . 
Then, since to leading order in e, 1/(1 + e) is 1 — s, 

[TV2(pi+^)] = P13 + eTT2{plnp) - epis Inpig + 0{e^) . 
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Thus, 

TTi(TTs[TT2{p'+nY^^'^'^)=l-eS{p)+eS{pn) + 0{e^) . (1.8) 
In the same way, we find 

1^2 {pit') = P3 + eTV2 (/923 lnp23) + O(e') , 

and then 

Tra [Tr2 {p\t'f'''^^'^] = 1 - eS{p2^) + eS{p^) + 0{e^) . (1.9) 
Combining p.8p . p.9|) and the g = 1 case of Theorem II. 3t we obtain 

S{pi3) + 5(p23) > 5(P) + Sips) 

in the finite dimensional case, where there is no issue of uniformity in the 0(6^) remainders; The 
general case follows by finite dimensional approximation (see the appendix to [T3].) I 

1.2 Non-commutative L^(Lp) norms 

We point out that the convexity of ^p,q can be used to define certain non-commutative analogs 
of the L'^(U') norms: Let {X,dp) and {Yjdv) be sigma-finite measure spaces. Any non-negative 
measurable function / on X x y such that 



X \jy 



g/p 

F{x,y)du] dp 



< oo (1.10) 



may be regarded as a function on X with values in L^iY, dv), and then the quantity above defines 
the natural L'^ norm of this L^-valued function. 

Similarly, a non-negative operator A on the tensor product Tii ® 7^2 of two separable Hilbert 
spaces can be regarded as a non-commutative analog of such a function. The problem of construct- 
ing an L'^{U') norm was actively considered by several people in the operator space community. 
Pisier successfully found one solution to the problem, valid for all 1 < (7 < p. With 1/r := \/q — l/p 
Pisier's definition is 

= ^ inf P||2.||y||,||S||2. . (1.11) 

" " y I X={A®I)Y{B®I) 

Pisier proved this is a norm in ^17J. However, the definition does not bear such an obvious similarity 
to (jl.lOp . and it was an actively considered problem to decide if "^p^q is convex and whether this 
convexity could be used to define a non-commutative L'^[U') norm. The convexity is now established 
in Theorem 1 1 . 1 1 but . since ^p,q is defined only for positive operators, it does not directly define a 
norm - even for self-adjoint operators. (Note that we cannot simply replace X by \X\ since 
X 1-^ \X\ is not operator convex.) However, the following standard construction does give us a 
norm for self-adjoint operators when 1 <p <2 and all g > 1: 

= inf {^p^q{A)+mp^q{B)) . (1.12) 

X = A-B, 
A > 0, B > 
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This definition satisfies the triangle inequality because if we write X = A — B and Y = C — D for 
A, B,C,D> 0, then X + Y = {A + C) - {B + D) and 

+ =inf--- < ^p,g{A + C) + %,g{B + D) 

< ^p,,{A) + ^pJC) + ^pjB) + ^pjD) 

The second inequality above is the convexity of ^p,q, which, since ^p,q is homogeneous of degree 
one, amounts to subadditivity. 

If we now choose A, B,C and D so that '^p^g{A)+'^p^g{B) nearly equals |||^|||l9(lp) and ^'p^q(C) + 
^p^q{D) nearly equals we conclude 

111-'^ + '^\Ili{Lp) < 111-'^ III L9 (LP) + III ^ III L? (LP) ■ 



1.5 Remark. It is to be noted that A ^p^q{A) is not monotone, but the norm A — > |||A||| is 
monotone (by construction). Even for A > 0, it is not necessarily the case that |||A||| = ^pq{A) 
although |||A||| < ^p^q{A), in general. 

Now that one has an L'^{L^) norm defined on self adjoint operators A on Tii ® TC2, one can go 
on to define an L'^{U') norm for all operators A on 7ii®7i2, self adjoint or not, as follows: Consider 
the block matrix 

A 
A* 



A 



(1.13) 



which is a self adjoint operator on Tii TI2 C3 C^. For present purposes, we regard this Hilbert 
space as Hi (^2 C^). That is, when computing "^p q of a positive operator on Hi {H2 C^), 
Hi is the first factor, and H2 CS> is the second factor. We then define 







II^IIl'j(lp) = 2III-' 


III L9 (LP) , 



(1.14) 



with A and A related as in (jl.lSp 

Notice that if A happens to be self adjoint, and A = B — C where B and C are positive, then 



A 



B-C 
B-C 



B+C B-C 
B-C B+C 



B+C C-B 
C-B B+C 



is a decomposition of ^ as a difference of positive operators. That is, every decomposition of A as 
a difference of positive operators induces a corresponding decomposition of A. 



Moreover, with the unitary operator U defined by U 



1 
71 



/ 



/ 



B + C B-C 
B-C B+C 



U* = 2 


B 







and lA 







c 





B+C C-B 
C-B B+C 



U* = 2 


' C 










B 



Because the unitary operator U respects the particular structure of Hi ® {H2 ® 



B+C B-C 
B-C B+C 



U* 



2^ 



p,<i 



B 

c 



2i^p,qiB) + ^p,q{C)) 
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and similarly for the other term. Thus, 



p,q 



B + C 
B-C 



B-C 
B + C 



PA 



B+C C-B 
C-B B+C 



2{^,,,{B) + ^,,,{C)) 



It therefore follows, for self-adjoint A, that 



|^||l'3(Lp) < 



LI {LP) 



In fact, there are many more ways to decompose ^ as a difference of two positive operators besides 
just those induced by a decomposition of A. Therefore, in general we can expect the last inequality 
to be strict. 

We note that we could have proceeded slightly differently: For the purpose of computing '^p^q of 
a positive operator on Tii TL2 C^, we could have regarded this Hilbert space as (C^ 03 Wi) ^2- 
This provides a second, distinct, way to construct the norm for general operators. It is an interesting 
open problem to compare the norms constructed this way using ^p^q with Pisier's non-commutative 
L'^{U') norm defined in (jl.lip . 



1.3 Organization of the paper 

In the next sections, we give the proof of Theorem II. 1[ The key will be to prove the convexity and 
concavity properties of Tp^g, which amounts to a generalization of Epstein's Theorem. This will be 
done in Section 2. 

Then, in Section 3, we shall be able to deduce the convexity and concavity properties of the 
other two functional from this using the same strategy that was used to deduce the corresponding 
concavity results from Epstein's Theorem in [3]. The proof of Theorem II. II concludes Section 3. In 
Section 4 we shall deduce Theorem 11.31 from the convexity and concavity properties of ^p,g, again 
using the same strategy that was employed in [1] in the case 0<p<l, q = 1. Finally, in Section 
5, we discuss relations among the various convexity and concavity results discussed here. Roughly, 
it turns out that all of the convexity statements are equivalent to one another: There is a chain of 
deduction starting from any one of them, and leading to any other. The same is true for concavity. 
Section 5 concludes with some brief historical comments. 

Acknowledgements We thank Frank Hansen and Mary Beth Ruskai for helpful comments on a 
draft of this paper. We also thank Quanhua Xu for a careful reading of the draft and for pointing 
out a number of misprints. 



2 Convexity and concavity properties of ^ 

As indicated at the end of the introduction, the main methodological novelty of the paper lies in 
this section. The proof of convexity of Tp^^ divides into two cases which are 1 < < p < 2 and 
1 < p < 2 with q > p. 

The next lemma treats the latter case, q > p, which is the easiest. 

2.1 LEMMA. For 1 < p <2 and q > p, '^p,q is convex on H^. 
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Proof: Since r := q/p > 1 and since B* A^B > 0, we can write 



\B*APB\\r = sup T^{B*APBY) 

\\y\\r' < 1, 
y > 



(2.11 



where 1/r + l/r' = 1. Since Ap is well known to be operator convex in A for 1 < p < 2, so is 
B*APB. Since the right side of (j2.ip is the supremum of a family of convex functions (note that 
y > is needed here) we conclude that is convex. (This quantity is the ^th power of 

Tp ,j(A), which is homogeneous of degree r > 1; therefore, by the Remark 11.21 after Theorem 
Tp^q{A) is convex.) ■ 
The case q < p cannot be treated by such elementary means, and the next three lemmas present 
the tools that we shall use. The first is a variational formula for pth roots. 

For all r > 1, and all c, x > 0, the arithmetic-geometric mean inequality says that 

1 



1 ^ r 
~^ + — 



-X > cx 



r-1 



and so 



l)x : X > 



r IX' 

One may then easily verify the corresponding formula for < r < 1: 



1 

c = - sup 

r 



— + (r — l)x : X > 



(2.2) 



(2.3) 



We shall develop this into a variational formula for Tp^g. For what follows, it is first useful to 
note that since B* A^B and A^^'^BB* AP/'^ have the same spectrum. 



Tp,g(^) = TV \{APl^BB*APlyiP 



(2.4) 



2.2 LEMMA. For any positive n x n matrix A, and with r = p/q > 1, we have, for any p 

1 



Tp,g(A) = ^inf|Tr 



AP/^B——B*AP/^ + (r- 1)X 



X 



r-l 



X>0 



where the infimum is taken over all positive n x n matrices X. Likewise, for r = p/q < 1, 



Tp^giA) = -sup<^TV 



AP/^B^--B*AP/'^ + (r - 1)X 



X>0 



where the supremum is taken over all positive n x n matrices X. 



(2.5) 



(2.6) 



Proof: Let C = B*Ap/'^. By continuity we may assume that C*C is strictly positive. Then, for 
r > 1, there is a minimizing X. Let Y = and note that minimizing (j2.5p with respect to X is 

the same as minimizing |Tr {CC*Y + (r — l)y~^/*^'~~^)) } with respect to Y . Since the minimizing 
Y is strictly positive, we may replace the minimizing y by y + tD, with D self adjoint, and set the 
derivative with respect to t equal to at t = 0. This leads to TrL'[CC* — y~''/(''~^)] = 0. Therefore 
we are done. The variational formula (j2.6p is proved in the same manner. ■ 

While these variational formulas may not appear to be very promising for our purposes since, 
in general, the infimum of a family of convex functions is not convex, they will be useful on account 
of the following lemma. 
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2.3 LEMMA. If f{x,y) is jointly convex, then g{x) defined by 

g{x) = mif{x,y) 
y 

is convex. The analogous statement with convex replaced by concave and infimum replaced by 
supremum is also true. 

This lemma is well known in the theory of convex functions (see Theorem 1 in [19]), but we 
recall the simple proof for completeness: 

Proof: For any xq and xi, and any < A < 1, pick e > 0, and choose yo and yi so that 
f{xo,yo) < 9{xo) + £ and f{xi,yi)<g{xi) + e. 

Then 

g{{l - X)xo + Xxi) < /((l-A)xo + Axi,(l-A)yo + A?/i) 

< (1 - A)/(xo,yo) + A/(xi,yi) 

< {l-X)g{xo) + Xg{xi) + 2e . ■ 

A special case of this lemma concerns the convex function on x given by f{x,y) = 
h{x — y)+g{y) in which h and g are convex on R". Then inf^ f(x, y) is called the infimal convolution 
of h and g. Its convexity accounts for the following physical fact: When one combines two physical 
systems and allows heat to flow between them so that the total energy x is conserved, the energy 
distributes itself so that the total entropy is maximized. Thus, the total entropy, given by 

S{x) = sup{S'i(x -y) + S2{y)}, 
y 

is again a concave function of the total energy x, as any legitimate entropy must be. 

On account of Lemmas 12.21 and 12.31 we can prove the stated convexity and concavity properties 
of Tp^q if we can prove that 

{A, X)^Ti (^AP/^B*j^BAP/^^ (2.7) 

jointly convex for 1 < r < p < 2, and jointly concave for < p < r < 1. In fact, for p = r = 2 it is 
known [15] that 

(C,A) ^ C*X-^C 

is even operator convex, which gives us more than we need in this case. 

To handle the general case, we need the following joint convexity result of Ando [I], and a 
concavity result in [12j : 

2.4 LEMMA. The map 

{A, B) ^ AP B^-'' 

on X H^, is jointly convex for all 1 < r < p < 2 (Ando's convexity theorem) and is jointly 
concave for all < p < r < 1 (Lieb's concavity theorem). 
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2.5 Remark. It is well known that the convexity /concavity stated in the lemma is equivalent to 
the convexity /concavity of {A, B) Th:A^K*B^^^' K for every matrix K G 1V[„. The difference 
between the two formulations is merely notational. A matrix Kij can equally well be regarded as 
a vector K^'"' in O C". Then TrAPK*B^-''K = Ap ® B^'"' K™'^). 

Proofs of both results can be found in [1]; see Corollary 6.2 for the concavity, and Corollary 
6.3 for the convexity. Next, we show that Lemma 12.41 provides the desired convexity and concavity 
properties of the map defined in (12. 7p . 



2.6 LEMMA. The map on x defined in (2.1) is jointly convex for all 1 <r <p <2 and 
is jointly concave for all < p < r < 1. 



Proof: We first rewrite the right hand side of (|2.7p in a more convenient form: Define 



Z 



A 
X 



and K 




B 



Then 



and so, using the cyclicity of the trace. 



B*X^-PB 




Tt{ZPK*Z^^''K) = Tr (^Ap/^B*j^BAp/^^ 



Note that convexity /concavity of the left hand side in Z is the same as convexity/concavity of the 
right hand side in (A,X). 

The relation between the left hand side and ZP ® Z^~'^ is explicated in Remark [ 

We are now ready to prove the main result of this section: 



2.7 LEMMA. For 1 < q < p < 2, Tp^g is convex on H+, while forO<p<q<l,T 
on H+. 



p,<i 



IS concave 



Proof: By Lemma 12.61 the mapping in (j2.7p is jointly convex for 1 < r < p < 2. Then taking 
r = p/q, we have from Lemma 12.21 and Lemma [ 



TpJA) = mff{A,X) 



X 



where f{A, X) is jointly convex in A and X. The convexity of Tp^g now follows by Lemma 
The concavity statement is proved in the same way. 



3 Convexity and concavity properties of and ^ 



3.1 LEMMA. For 1 < p < 2 and all q > 1, $p,g is jointly convex on (H+)'", while for < p < 
9 < 1, ^p,g is jointly concave on (H+)"^. 
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Proof: First, consider the case m = 2 and define 



A 









A2 



and 



I 
/ 



Then 



But 



A^ + aA^a 



Al + Al 
A\ + AP^ 



AP + aA^a = 2 



Now define 



AP 



2 

I±a 



+ 2 



(T 



and observe that these are complementary orthogonal projections. Therefore, the q/pth. power of 
the sum on the right is simply the sum of the q/pth powers, and hence 



2Tr 



{AP + A^y/P] = 29/fTV [(n+^Pn+)5/Pl + 2''/PTr \{u^APu^y^p 



(3.1) 



By Lemmas 12.11 and 12.71 applied with A = A and B = Il± , we see that each of the two terms 
on the right hand side of ()3.ip is a convex function of A, for l<p<2, q > 1, and concave for 
< p < q < 1. Of course, convexity /concavity in A amounts to joint convexity /concavity in Ai 
and A2. 

Apart from a factor of 2, the left hand side of (|3.1|) is the qth power of ^p^q{Ai, A2). Again, 
using Remark 11.21 and taking the gth root of the right hand side, we preserve the convexity and 
concavity properties since the right hand side is homogeneous of degree q. Hence ^p^q{Ai, A2) has 
the stated convexity and concavity properties. This concludes the proof for m = 2. 

One can easily iterate this procedure to obtain the result for all dyadic powers m = 2^ , and 
hence for all m. ■ 



We now turn to ^ 



3.2 LEMMA. For l<p<2 and q>l, ^p^q is convex on H+„, while for < p < q < 1, "^p^q is 



concave on 



H+ . 



The proof turns on Lemma 13.11 and a method for writing partial traces as averages, which 
originates with Uhlman [23]. To recall this, fix some orthonormal basis {ei, 62, ... , e„} of C". Let 
Q be the group of unitary operators W such that for some permutation vr of {1, . . . , n}, and some 
function s : 1,2, . . . , N ^ {0,1}, 



Wej = (-l)'(^)e^(j) for ah j 



(3.2) 



It is clear that these form a group, and that the cardinality of the group is 2"'n!. The point of 
the definition is that any operator on that commutes with every element of G is necessarily a 
multiple of the identity, and so, for any operator A on C", 



1 



n 



[Tr2(A)]®/„ 



1 



2"n! ^ 



[I^W*]A[I(E)W] 



(3.3) 
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Proof of Lemma 13. 2t Applying 13.31 we have 



Tr 



n^/P-^TriTVs 



n 



1 \ i/P 

-[Tr2^P]®/„,xn 

n 



2"n 



^ y [/ w*]AP [1 w] 



n 



, 1 



Upon taking the gth root, the result follows directly from Lemma l3.ll 



3.3 Remark. While the proof shows that the conclusion of Lemma 13.21 follows from Lemma |3.H 
the reverse implication is even more elementary: To see this, suppose that the matrix A in Lemma 
13.21 is the block diagonal matrix whose jth diagonal block is Aj. Then, clearly, 

^p,q{A) =^p,q{Al,A2,...,A^) . 

We next show that for p > 2, is neither concave nor convex, not even separately in each Aj. 
The argument is a simple adaptation of the one we gave in [4] for q = 1. However, the argument 
has other uses, and so it will be useful to give it here explicitly. 

3.4 LEMMA. For p > 2, ^p^q is neither convex nor concave, even separately, for any values of 
qj^p. 

Proof: The basis of the proof is a simple Taylor expansion. By simple differentiation one finds 
that for any A,B £ H+, 



%JtA,B) = \\B\L + -\\B\\l-'}TTAPB'^-P + 0(t'^P) 
p ^ 

Keeping B fixed, but replacing Ahy Ai, A2 and (^1 + A2)/2, we find 
^%,i{tAi,B) + hp,iitA2,B) - cl>p,i (t^^^^, B^ = 



(3.4) 



p 



-TV {AP^B'^-p) + -Tr (A^^^-^) - Tr 



+ 0{t^P) . 



Now if p > 2, ^ 1-^ is not operator convex, and so we can find Ai and A2 in and a unit 
vector V in C" such that 



\{v,AP,v) + \{v,APv) - U i^^^^'v 



(3.5) 



is strictly negative. However, since A ^ TxAp is convex, there is necessarily another unit vector 
so that if we replace v by w, this quantity becomes positive. 
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Hence for q > p, take B to be the rank one projection onto v. Then the bracketed quantity on 
the right hand side of (j3.5p is nothing other than (j3.5p . and so the left had side of (j3.5p is strictly 
negative. This shows that ^p^q cannot be convex for such p and q. Replacing v by w, the sign 
changes, and we see that it cannot be concave either. 

For q < p, take B = (el + vv*)~^ for some small value of e, so that B'^~p is essentially the 
orthogonal projection onto v, and then argue as before. ■ 

We have now completed all the steps needed for the proof of the main theorem: 

Proof of Theorem ll.lt Lemmas 12. 1112. 7 l 13. II and 13.21 respectivelv establish the convexity of Tp^g, 
^p^q and ^p,q for 1 < p < 2, q > 1, and the concavity of these functionals for < p < q < 1. Since 
we deduced the convexity /concavity properties of ^p^q from those of Tp g in the proof of Lemma 
13.11 it follows that if ^p^q is not convex or concave for some values of p and q, then neither is Tp^q. 
Likewise, in the remark following the proof of Lemma 13.21 we have observed that $p^g and ^p,q 
have the same convexity/concavity properties. Therefore, since Lemma [3.41 says that ^p^q is neither 
convex nor concave for p > 2, g 7^ p, the same is true of Tp^q and ^'p^g ■ 

4 Minkowski's inequality for trace norms on a product of three 
Hilbert spaces 

Proof of Theorem II. 3t First we reduce to the case q = 1: Suppose we have shown that 



for all positive A, and all 1 < p < 2. Then replacing A by A'^, and phy r = p/q, we obtain (jl.7p . 
The same works for the reverse inequality for < p < 1 except now we require q > p to have 
< r < 1. Therefore, it remains to prove (j4.ip . Consider first the case 1 < p < 2. 

Suppose that the dimension of TCi is n. The left hand side of ()4.ip can be written in terms of 



where the pair of spaces in the definition of is taken to be Tii (Xi Ti-s and TC2 ■ In the following, 
it will be useful to let I2 denote In^, to let Ii^s denote I'Hi'S'Hai and so forth. 
Then by (j3.3p and the convexity of ^p^q for 1 < q < p < 2, 



Trg [Tr2(TriA)P]i/^ < TYgTri {Tt^AP)'/^ 



(4.1) 





Now, by the definition of ^'p,i, and the fact that W is unitary, 

{[W* I2MA [W /2,3]) = TVi,3 [TT2 {[W* I2,3]A^ [W /2,3])] 
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However, 

Ti2 {[W* 12,3] [W 12,3]) = [W* l3]Tr2{AP) [W h] , 
and hence, again by the unitarity of W, 

[Tt2 {[W* l2,3]AP [W hM'^" = [W* ® h] [Tt2{AP)]^/p [W Is] ■ 

After taking the trace over TCi (X" "Hs, we obtain 

^-p,! {[W* I2,3]A [W l2,3]) = TrgTVi {TT2APf/P . 

Thus each term in the average in ()4.2p is the same, and we obtain (II. 7p . Note that since ^p,i is 
concave in the case < p < q < I, the inequahty in (14. 2|) reverses, and with it the inequahty in 
dLZl). ■ 

5 What follows from what 

In the chain of deduction leading up to the proof of Theorem II. H our starting point was Ando's 
convexity theorem and Lieb's concavity theorem quoted in Lemma 12.41 We then deduced convex- 
ity/concavity properties of Tp^g from this, and then convexity /concavity properties of g from 
those of Tp^q. We have already noted that the convexity/concavity properties of ^p^q and ^p,q are 
identical. 

It is interesting to observe that this chain of deduction can be brought full circle: Given the 
convexity of $p^i for 1 < p < 2, and the concavity for < p < 1, one can prove Ando's convexity 
theorem and Lieb's concavity theorem. This can be done using some ideas of Bekjan. In [3], Bekjan 
proved the joint concavity of <I>p.i for —1 < p < 0. He did this by first adapting Epstein's proof of 
his inequality to negative values of p, and then applying the method used in [4] to prove the joint 
concavity of ^p^i for < p < 1. 

His main interest in this result seems to have been the consequence that he derived from it by 
using the Taylor expansion ()3.4p at g = 1, where it simplifies to 



% i(tA,B) = TrB + -TrB^-PAP + 0(t^P) , (5.1) 
P 

for A,B £ H^. As Bekjan showed, since the constant term is affine, one can deduce from this 
formula and the joint concavity of {A,B) ^p^i{A, B) for —1 < p < the joint convexity on 
H+ X H+ of (A, B) ^ TtAP B^-^/p for 1 < p < 2. Now that we know that the joint convexity of 
{A,B) ^ ^p i[A, B) for 1 < p < 2, the argument can be made even more direct: 

5.1 LEMMA. Consider the maps {A, B) ^ $p,i(^, B) and {A, B) ^ TtAP B'^-'^/p on H+ x H+. 
For < p < g < 1 the joint concavity of {A,B) ^p^i(A, B) implies the joint concavity of 
{A,B) ^ TxAP B^^^/P . For 1 < q < p < 2 the joint convexity of {A,B) ^p,i{A, B) implies the 
joint convexity of {A,B) TrAP B^^^^p . 

Proof: It suffices to prove midpoint convexity/concavity. Fix Ai,A2 and Bi,B2 in H^, and for 
t > define 

fit) = ^-^p,,{tA,,B,) + ^^p,,{tA2,B2) - (t^^^, ^^4^) ^^-^^ 
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Note that /(O) = and that by TheoremlLll f{t) > (resp. < 0) for 1 < p < 2 (resp. < p < 1). 

We now apply the Taylor expansion (j3.4p to approximate each of the three terms in the definition 
of /. Since the constant term in the Taylor expansion is affine, it drops out (this is the reason for 
the restriction to q = 1 here), and consequently, to leading order in t, 



i-p 



+ -Tr 
2 



Tr 



(5.3) 



The quantity on the right that multiplies t^/p must have the same sign as f{t) for small t, which 
proves the convexity /concavity. ■ 

This leads to the conclusion that {A,B) TvA^B^^'^ is jointly convex/concave on H+ x H+ 
depending on the value of p. Bekjan further observed that one can "put the q back" into this 
inequality, thereby obtaining (with r = p/q) the more general result that {A,B) ^ TtAPB^~^ is 
convex for 1 < r < p < 2. Here is Bekjan's argument: Observe that < (1 — r)/(l — p) < 1, so 
that A I— > is concave. Also, — 1<1— p<0, sovli-^ A^~p is monotone non-increasing 

on H+. Thus, for any Bi, B2 G H+, and any A € (0, 1), 



(Ai?i + (1 - X)B2)^-^ 



[ASi + (l-A)B2]^'""^/^'~^^ 



i-p 



< 



XB 



(l-r)/{l-p) 
1 



+ (1 - A)S 



(l-r)/(l-p) 
2 



1-p 



Therefore, 



Tr 



(AAi + (1 - X)A2f (XBi + (1 - X)Br 



< Tr 



{XAi + {l-X)A2)nXBl 



M-r)/{l-p) 



+ (1 - X)B. 



(l-r)/(l-p) 



1-p" 



(5.4) 



Now applying the convexity of [A, B) 1-^ Tr^^i?^ Bekjan obtains 



Tr 



(AAi + (1 - X)A2Y (ABi + (1 - X)B2 

< ATr [A^^Bl 



1-r 



+ {l-X)Tr[AlBl- 



(5.5) 



which is the desired convexity conclusion. In the same way, one extends the concavity result to 
< p < r < 1. Bekjan was aware that the concavity result one obtains in this way is equivalent 
to Lieb's concavity theorem, but he was either unaware of Ando's convexity theorem, or unaware 
that his trace convexity theorem is actually equivalent to Ando's result. The equivalence is given 
by the following lemma: 

5.2 LEMMA. For any real numbers s,t these two statements are equivalent: 
(1.) {A, B) ^ Tr (A* 5*) is concave/ convex on x H+. 
(2.) (j4, B) ^ A^ ® B^ is operator concave/ convex x H+. 

Proof: (2.) easily implies (1.) because, as stated in Remark 12. 5| (2.) is equivalent to the 
concavity/convexity of {A, B) ^ Tr (^A'^K* B^K^ for every matrix K. To obtain (1.) simply take 
K = 1. 



CL November 30, 2007 



16 



To derive (2.) from (1.), it suffices to show that (1.) imphes that for each bounded operator 
K on Tl, {A, B) Tt (^A'^KB^K*^ is concave/convex, as explained in section [2j If IT is a unitary, 
U, this follows immediately from (1.) since (UBU*)*' = UB^U* and concavity /convexity in {A,B) 
is equivalent to concavity /convexity in {A,UBU*). To reduce the general case to the unitary case 
we may assume, without loss, that K is a contraction. Then K can be dilated to a unitary on the 

K L 

larger space 7i®7i. That is, there is an operator L such that W 



-L K 



is unitary, namely, 



L = U{I — l-ftTp)^/^, where K = U\K\ is the polar decomposition of K. Then, with A 



A 




and B 



B 




(5.6) 



Tr-H {A' K B* K*) = Trn^n W W*) , 
and the general case follows from the unitary case. ■ 

Thus, the joint convexity of {A,B) ^ Tt{AP B^^') for 1 < r < p < 2, proved in [3j is actually 
equivalent to Ando's convexity theorem, quoted in Lemma [2.41 This equivalence is significant for us 
because it brings our chain of implications full circle, back to the convexity and concavity theorems 
for tensor products. In Lemma 12.71 we deduced the convexity /concavity properties of Tp^g from 
those of the tensor products. In Lemma [XT] we deduced the convexity/concavity properties of $p^g 
from Tp q. In Lemma W% and the remark following it we proved that and ^p,q have the same 
convexity/concavity properties. 

The chain of implications is summarized in the following diagram, in which the labels on the 
arrows give the relevant lemmas. 



Lemma 15./ 

T¥ {AP B^-P/'i) 



Lemma 



Lemma 



^ V q (Remark after Lemma 3.2) 



5.1 Historical Remarks: 

We end this paper with some historical comments on the large literature on related matrix convexity 
and concavity theorems. This is by no means meant to be a complete survey. 

Many lines of investigation in this field can be traced back to the Wigner-Yanase papers [251 [26] 
that drew attention to the concavity oi A ^ lh:{AP K A'^~p K*) and proved it for p = 1/2. Their 
motivation was that in a quantum system, some states are easier to measure than others; if a 
density matrix p commutes with a conserved quantity (say the energy) then it is easy to measure, 
and otherwise not. Thus, while the von Neuman entropy of any pure state p is zero, some pure 
states have a higher information content than others - namely those that are not functions of the 
conserved quantities. In the presence of one conserved quantity, represented by the self-adjoint 
operator the Wigner-Yanase skew information of p is 



-Tr{[^p,Kf) 
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Note that this is 



I{p) = TtK^p - Tt^K^K . 



In [25], Wigner and Yanase hsted a number of requirements that they considered to be necessary for 
a measure of information. They conjectured these to hold for I{p). One of them was the convexity 
of /(p), and this was proved in [26] . Another was a certain subadditivity property, and this has 
only very recently been shown to be false by Hansen [7]. The hope that subadditivity would hold 
under additional assumptions was shown to be false by Seiringer [21] 

The general case Ti{A^KA'^K*) with p, g>0, p + q <1 (and K not necessarily self-adjoint) 
was done in |i.l2] in connection with a proof of strong subadditivity of entropy [H]. This concavity 
result was the impetus to several papers and several different proofs. The earliest after [12] was by 
Epstein [6] who derived the concavity, and more, by using the complex-analytic theory of Herglotz 
functions. Not only did he prove the concavity in |12j . he also proved the concavity of Tp^i for 
< p < 1. (Paper [6] also led to other results not directly related to our line of inquiry here.) 
Another one is Uhlmann's fundamental result [23] on the monotonicity of the relative entropy in a 
von Neuman algebra, building on Araki's extension [2] of the notion of relative entropy to this more 
general setting. Another useful extension of strong subadditivity and related concavity inequalities 
is in [IB]; this was applied in [22]. . 




Ando's work [T] was also partially motivated by [T2] and gave another proof of the concavity and 
many new concavity /convexity results, some of which are used here. His was the first proof using 
purely real variable methods. He realized that the fact that the geometric mean of two positive 
matrices A and B, namely A^/'^{A^^/'^BA^^/'^Y/'^A^/'^^ is a jointly concave function of A and 
as is the harmonic mean, namely 2{A~^ + B^^)^^ , is intimately related to the convexity /concavity 
properties of {A,B) ^ AP <^ B^'"' , or, equivalently, ofA^ Tr{APKA^-''K*). 

This result in [12] has been generalized in many ways, e.g., Kosaki [10] found new proofs based 
on interpolation theory that provided an analog of the concavity theorem in a general von Neuman 
algebra setting. 

Since the original proof of strong subadditivity of quantum entropy, there have been many others 
(see Ruskai's review [20j). One proof was given in our paper [4], in which strong subadditivity was 
derived from a Minkowski type inequality for traces, the 0<p<l,g = l version of Theorem 
II. 3i Another proof ^ that is relevant for the present paper came from recent results in the theory 
of operator spaces; this proof also uses a Minkowski inequality, but in Pisier's non-commutative 
L^{U') norm (jl.lip discussed in the introduction. One interesting remaining issue is the condition 
for equality in our convexity /concavity theorems, similar to the related issue of equality for strong 
subadditivity discussed in [20l |8] . 
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